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ABSTRACT. In this paper, we continue the study of a left-distributive algebra of elementary embeddings 
from the collection of sets of rank less than A to itself, as well as related finite left-distributive algebras 
(which can be defined without reference to large cardinals). In particular, we look at the critical points 
(least ordinals moved) of the elementary embeddings; simple statements about these ordinals can be 
reformulated as purely algebraic statements concerning the left distributive law. Previously, lower 
bounds on the number of critical points have been used to show that certain such algebraic statements, 
known to follow from large cardinals, require more than Primitive Recursive Arithmetic to prove. Here 
we present the first few steps of a program that, if it can be carried to completion, should give exact 
computations of the number of critical points, thereby showing that hypotheses only slightly beyond 
Primitive Recursive Arithmetic would suffice to prove the aforementioned algebraic statements. 



1. Introduction 

Let V\ be the collection of sets of rank less than A, where A is some fixed limit ordinal. The 
assumption that there is a nontrivial elementary embedding from V\ to V\ is an extremely strong 
large cardinal hypothesis [8]. But once one such embedding is known to exist, more of them can 
be obtained by applying embeddings to each other. If a and b are two such embeddings, then one 
cannot literally apply a to b since b £ V\, but one can apply a to initial segments of 6, so we define 
a(b) to be (Ja<A f Va)- Then a(b) will also be an elementary embedding from V\ to V\. 

Let £\ be the set of all nontrivial elementary embeddings from V\ to V\, equipped with the binary 
operation of application. Elementarity implies that this operation is left distributive over itself: 
a(b(c)) = a(b)(a(c)). Assuming £\ is nonempty, let j be a fixed nontrivial elementary embedding 
from V\ to V\, and let Aj be the subalgebra of S\ generated by j. 

One may also consider the operation of composition of embeddings, in which case one can list 
four laws satisfied by the two operations: 

(1.1) (a o b) o c = a o (6 o c), (a o b)(c) = a(b(c)), a(b o c) = a(b) o a(c), aob = a(b)oa. 

Let Vj be the subalgebra of £\ generated by the fixed embedding j using both application and 
composition. 

Laver [6] has shown that Aj is a free algebra on the generator j with respect to the left distributive 
law, and Vj is free with respect to (1.1). He also showed that Aj has an algebraic property called 
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irreflexivity; this gives a proof that the free left-distributive algebra on one generator is irreflexive, 
assuming an extreme large cardinal hypothesis. The question of whether the large cardinal is 
necessary for this purely algebraic result was resolved by Dehornoy [1], who proved the result 
without using any large cardinals; in fact, Dehornoy's proof goes through in a very weak system of 
arithmetic (Primitive Recursive Arithmetic). 

Any embedding k € Aj maps ordinals to ordinals in a strictly increasing manner, so k{a) > a 
for all a < A; the least a such that k(a) > a is called the critical point of k, and denoted by cr k. 
This a is inaccessible, measurable, etc., and k(x) = x for x € V a [8]. Elementarity implies that 
k'(crk) = a(k'(k)) and k'(k((3)) = k' (k)(k' (f3)); using these rules, we can obtain many ordinals 
below A as critical points of members of Aj. To start with, let kq = cr J an d K> n +i = j( K n) for 
n £ uj; the ordinals n n form a strictly increasing sequence (called the critical sequence of j), and all 
of them are critical points of members of Aj (if K n = crfc, then K n+1 = crj(k)). But these are not 
the only ordinals which occur as critical points. In fact, if F(n) is the number of critical points of 
members of Aj lying below n n , then F grows faster than any primitive recursive function [2]. On 
the other hand, Laver and Steel showed that F(n) is finite [7]. 

Laver [7] has also shown that the algebra Aj has as homomorphic images a sequence of finite 
left-distributive algebras A n which can be defined by explicit construction, without using strong 
hypotheses; they can also be defined by purely algebraic methods. Then the results about Aj 
(in particular, the Laver-Steel theorem) have consequences which are purely algebraic statements 
about these finite structures; it is again natural to ask whether large cardinals are necessary for 
the proof of these statements. This question is currently open, but it is known that at least some 
strength is needed: the statements cannot be proved in Primitive Recursive Arithmetic [3]. 

The statements about critical points mentioned above have natural translations into the context 
of the finite algebras A n ; in particular, one can define the function F(n) in this context. Then the 
statement that F(n), as computed from the finite algebras, is finite for all n can be viewed as the 
algebraic version of the Laver-Steel theorem. A more natural statement equivalent to this one is 
that every term built up from one generator using the left-distributive operation will turn out to 
be nonzero when evaluated in some A n . An interesting particular case: the statement that F(A) is 
finite is equivalent to "There is an n such that 1 * 16 ^ in A n ." One can easily write a computer 
program to search for such an n; this program is known to terminate, but the only known proof at 
present requires extreme large cardinals (for this particular result, a 4-huge cardinal will suffice). 
One can also give algebraic characterizations of the algebras A n which allow one to translate ll F(n) 
is finite" into purely algebraic statements of the form "under the left distributive law, one type of 
equation does not necessarily imply another type of equation." 

The proofs of the lower bounds on F(n) [2] work by producing infinite (or at least very large) 
two-dimensional arrays of embeddings, in which a certain number of embeddings in one column 
generate a much larger number of embeddings in the preceding column, and the embeddings in 
column 1 produce a large number of critical points. A uniform construction of this sort produces 
enough critical points to show that F grows faster than any primitive recursive function. There is 
no reason to believe that this construction produces all of the critical points; in fact, it produces 
256 critical points below K4, while a much more irregular construction gives many more. But even 
the latter construction probably does not produce all critical points in the relevant range. 

The goal of this paper is to give the first steps toward an optimal construction of this type; 
that is, one which will produce all of the critical points of members of Aj. If such a construction 
can be completed explicitly, and if the critical points K n appear at known places in the array, 
then one will be able to give explicit recursive formulas for computing the function F, and hence 
give a constructive proof of the Laver-Steel theorem. Furthermore, if the methods used are simple 
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enough, then they will be translatable into the context of the finite algebras A n , thus showing that 
the algebraic statements above can be proved in a theory much weaker than the large cardinal 
assertions currently needed. In fact, one will probably only need to assert that the recursion used 
to compute F produces a total function, and this might require no more than Primitive Recursive 
Arithmetic together with the statement "the Ackermann function is total." (But if F turns out to 
be even faster-growing than currently known, then a somewhat stronger axiom might be needed.) 

Along the way, we will produce new structural results about the finite algebras A n , as well 
as additional specific results about the lower critical points (for instance, the 256 critical points 
below K4 mentioned above turn out to be the Erst 256 critical points). An interesting point is that 
some of the main results here were discovered through examination of bit patterns from printouts of 
extensive computations in the algebras A n ; this may be the first serious example of computer-aided 
research in the theory of large cardinals. Conversely, the new results led to improved algorithms for 
computing in these algebras, thus allowing the development of software for carrying out experiments 
at a much higher level. 

The results in this paper have a large overlap with results of A. Drapal [4, 5], which were obtained 
independently at about the same time, but from a quite different point of view. 

2. Elementary embeddings and finite algebras 

In this section, we review the relevant definitions and fundamental results concerning the ele- 
mentary embedding algebra Aj, the finite algebras A n (n e w), and the connections between them. 
Most of the results here come from Laver [6,7] and Dougherty and Jech [3]. 

Elementary embeddings and critical points. Section 1 gave a number of the basic definitions 
on this topic (the algebra £\ and its subalgebra Aj generated by a fixed single embedding j, the 
critical point cr k of an embedding k, etc.). We will usually abbreviate application by writing just ki 
instead of k(i). Longer concatenations should be associated from the left: k^k^k^ is short for 
((^1^2)^3)^4, or, using parentheses for application, ki(k2)(kz)(ki) . The differing uses of parentheses 
for application and for grouping (not to mention for parenthetical remarks) do not conflict with 
each other, but it is convenient to remove as many such parentheses as possible. 

For any embedding k € Aj, one can construct an infinite sequence of ordinals by starting 
with the critical point cr k and repeatedly applying the embedding k; this sequence is called the 
critical sequence of k. Since A;(cr k) > crk and k is strictly increasing on ordinals, the critical 
sequence is strictly increasing; since k(cri) = cr (ki), all ordinals in the critical sequence are critical 
points of members of Aj. By a theorem of Kunen (in the form given by Solovay, Reinhardt, 
and Kanamori [8]), the critical sequence of k is cofinal in A. Hence, A is not a 'large cardinal' in 
the strongest sense (it is not even regular, but has cofinality uj), but it is a cardinal, and is a limit 
of critical points, which are extremely large cardinals. 

The set of critical points of members of Aj is countably infinite, since Aj is countable and any 
critical sequence gives infinitely many critical points. (One can show that Aj and Vj give the same 
set of critical points, since every element of Vj can be approximated by an element of Aj [7].) Let 
7o> 7i> 72, ■ ■ ■ be a list of the first 00 members of this set, in increasing order. Then the Laver-Steel 
theorem states that all critical points of members of Aj are in this list. This is a consequence of 
two separate results. The first, due to Laver [7], comes from an analysis of the particular sequence 
of embeddings j {n) defined by = j and j{n+i) = j {n )j (that is, j {n) = jj ...j with n fs); it 
follows from results given later in this review. 

Theorem 2.1 (Laver). For any n > 0, cr j( n ) = where m is the largest natural number such 
that 2 m divides n. 
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The second result was a by-product of inner model research of Steel. The original proof (see 
Laver [7]) is in terms of extenders; a simplified version is given below. When applied with ko = 
k± = ■ ■ ■ = j, the result states that {cr j( n y.n > 1} is cofinal in A; this, together with Theorem 2.1, 
implies that no critical point can lie above all of the 7„'s, so the 7„'s are all of the critical points. 

Theorem 2.2 (Steel). For any ko, ki, . . . G £\, sup new cr(fco^i ■ ■ ■ k n ) = A. 

Proof. For i G E\ and 7 < A, let 5 i)7 be the following set of ordinals: 

Si n = {i(f)(s): f, s G Vy, fa. function whose values are ordinals}. 

Claim. If 7 < A is an inaccessible cardinal, k,i G £\, and cri < 7, then the order type of <Si(fc), 7 
is less than the order type of 5j )7 . 

Proof. Since cri < 7, there must be an ordinal a < 7 such that i(a) > 7. Let G be the function 
defined on pairs (/, s) G V a x with / a function whose values are ordinals and s G domain(A;(/)), 
such that G((f,s)) = k(f)(s); then Sk, a is the range of G. We have |S , fc iQ [ < \V a \ < 7, so there 
is an order-preserving function H : Sk, a — > /3 for some /? < 7. Hence, i{H): Si^),i( a ) is 
also order-preserving. Note that C S i ^k) > i( a )- H 5 £ Si(k) n , then (5 = i(k)(f)(s) for some 

f,sEVy with / a function whose values are ordinals; then 

i (F)(<5)= l ( J ff)(,(A;)(/)( S )) 
= i( J ff)( i (G)((/,,})) 
= i(ffoG)«/, a ». 

Since oG is in V^, i(H)(5) is in S*i i7 . Therefore, i(-ff) gives an order-preserving map from S^fc)^ 
into fl (Sj^, which is a proper initial segment of Si tl [since i(/3) = i({(0, /3)})(0) G ^-y], so the 
order type of S^-j^ is less than that of 5i, 7 - O 

Now let z n = fcpfci . . . k n , and suppose sup ragw cr i n < A. Then, by Kunen's theorem, there is an 
inaccessible 7 < A such that sup ne ^cri ra < 7. Let n be the order type of Si„, 7 ; then the claim 
gives 9 > 9i > d 2 > ■ ■ ■ , contradiction. ■ 

This, along with Kunen's theorem (which can be viewed as a consequence of the Laver-Steel 
theorem, but was needed for the proof of that result), is almost the only place in the development 
here where any real set theory is used; the other arguments used could be performed in an ab- 
stract context where we have objects called 'embeddings,' other objects called 'ordinals,' and some 
operations and relations connecting them which satisfy a few basic axioms [3]. 

Some of the more basic facts about embeddings and ordinals are: each embedding gives a strictly 
increasing mapping from ordinals to ordinals, and k{a) > a for all a; cr k is the least ordinal moved 
by k; k(i(a)) = ki(k(a)) and k(cvi) = ci(ki). These facts can be applied to the initial critical 
points 7„ without using the Laver-Steel theorem. This gives the following results, which are stated 
in a somewhat ugly form for reasons which will be given later. 

Proposition 2.3. Let k,i G Aj and n,m G u. Then: 

• Ifk(j n ) > 7 m , then k(j n+1 ) > j m+ i. 

• Ifcri > "i n and k{^ n ) > j m , then cr (ki) > j m . 

• Ifcri ^ 7„ + i and k(j n ) ^ 7 m+ i, then cr (ki) ^ J m +i- 

• cr k > 7 m+ i if and only if k(j m ) ^ j m +i- ■ 
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If we have embeddings k,i G £\, and we are only interested in determining a limited part of ki, 
then clearly we only need a limited part of i: ki \ Vk(p) = k(i \ Vp). This only requires a limited 
part of k as well, namely k \ V a for any a greater than the rank of i \ Vp. But these limited parts 
are of varying sizes; in general, it is not possible to determine ki \ Vp from k \ Vp and i \ Vp. 

Laver [6] has defined a variant form of 'restriction' that gets around this problem, by suitably 

restricting both the domain and the range. Let kDVp = {(x,y) G Vp x Vp: y G k(x)}. We again 
have kiP\V k (p^ = k(iP\Vp), but now one can also show that, for any limit ordinal f3, kiC\Vp and 

{k o i)(~\Vp depend only on knVp and if\Vp. We can also define the corresponding version of 

p * * 

'agreement up to /3': say that k = k' if k (~l Vp = k'P\Vp. This means that k and k' agree about 

what happens in Vp; in particular, if k(x) G Vp and k = k' , then k'(x) = k(x). Also, if cri > /3, 

then i does not move anything in Vp, so we get k = ik for any k. Applying these general facts to 
the specific case of the ordinals 7 n gives: 

Proposition 2.4. Let k, k', i G Aj and n,m G co. Then: 

• = is an equivalence relation. 

• Ifk = k' , then k(~f m ) > 7„ iffk'(j m ) > -y n . 

• If k = k' , then ki = k'i, k o i = k' o i, and i o k = i o k' . 

• If k = k' and i{^ n ) > lm, then ik = ik' . 

• If a i > j n , then k = ik. ■ 

The finite left-distributive algebras A' n and A n . This subsection is a summary of results from 
sections 3 and 4 of Dougherty and Jech [3]; see that paper for full proofs. 

For each natural number n, the algebra A' n is defined to be the set {1,2,..., 2™} with a binary 
operation * n satisfying the following recursive formulas: 

(2.1a) 2 n * n b = b; 

if a < 2 n , then 

(2.1b) a* n l = a + l; 
if a < 2 n and h < 2 n , then 

(2.1c) a * n (b+ 1) = (a * n b) * n (a + 1). 

This is a valid recursion, because the following inductive condition is maintained: 

(2.2') a* n b>a if a < 2 n . 

We will want to use reduction modulo 2™ to map natural numbers into the set {1,2,..., 2™}, 
rather than the usual {0, 1, . . . , 2™ — 1}; let x mod' 2™ be the unique member of {1,2,..., 2™} which 
is congruent to x modulo 2 n . In particular, mod' 2 n will be 2 n . 

For any fixed a in A' n , consider the sequence a * n 1, a * n 2, . . . , a * n 2 n in A' n . If a = 2 n , this 
sequence is just 1,2,..., 2™. If a < 2 n , then the sequence begins with a + 1, and (by (2.1c)) each 
member is obtained from its predecessor by operating on the right by a + 1; hence, by (2.2'), the 
sequence must be strictly increasing as long as its members remain below 2™. Once 2 n is reached 
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(as must happen in at most 2 n — a steps), the next member will be a + 1 again, and the sequence 
repeats. Therefore, the sequence a * n 1, a * n 2, . . . , a * n 2 n is periodic (as long as it lasts); each 
period is strictly increasing from a + 1 to 2 n . We will refer to the number of terms in each period 
of this sequence as the period length of a in A' n , or the period of a in A' n . (The period of 2 n in A' n 
is2 n .) 

One can prove the following results by simultaneous induction on n: 

Proposition 2.5 [3]. 

(a) The period of any a in A' n is a power of 2; equivalently, a* n 2 n = 2 n for all a. 

(b) The formulas (2.1) hold modulo 2 n in A' n even when a or b is 2 n . 

(c) Reduction modulo 2 n is a homomorphism from A' n+1 to A' n : 

(a b) mod' 2 n = (a mod' 2 n ) * n (b mod' 2 n ) 
for all a, b in A' n+1 . 

(d) For any a < 2 n in A' n , if p is the period of a in A' n , then the period of a + 2 n in A' n+1 is 
also p, and the period of a in A' n+1 is either p or 2p. The period of 2 n in A' n+1 is 2 n . ■ 

Given these properties of A' n , the proof that the left distributive law holds in A' n is a straight- 
forward triple induction. 

Let a be an element of A' n , with period 2 m . Then the left distributive law implies that the map 
b i ^ a* n b is a homomorphism from A' n to A' n , with range S a = {a* n 1, a* n 2, . . . , a* n 2 m }, so S a is a 
subalgebra of A' n . Since A' n is generated by 1, S a is generated by a* n 1. Furthermore, a* n b = a* n b' 
if and only if b = b' (mod 2 m ), so S a is isomorphic to the algebra A' n with elements identified if 
they are congruent modulo 2 m ; but this is just A' m , by repeated application of Proposition 2.5(c). 
Since any element a of A' n is of the form a * n 1 for some a (namely a = a — 1, or 2 n if a = 1), we 
see that any element of A' n generates a subalgebra which is isomorphic to A' m for some m (and the 
isomorphism is just the unique order-preserving bijection between the subalgebra and A' m ). 

One can define an algebra A n which looks just like A' n , except that 2 n is renamed 0. This has 
no effect on the structure of the algebras, but it affects statements referring to the ordering of the 
elements of the algebra. In particular, (2.2') becomes: 

(2.2) either a * n b = or a* n b> a. 

Also, the ordinary mod operation now gives the homomorphism from A n+ i to A n . The element 
satisfies the identities a * n = and * n a = a. 

The algebras A n (or A' n ) have a purely algebraic characterization: A n is a free algebra on one 
generator 1 subject to the left distributive law and the equation 1 = 1 * 1 * ■ • • * 1, where the 
product has 2™ + 1 l's, associated from the left. This is proved in Wehrung [9], with an error that 
is corrected in Drapal [4]. 

Since we have homomorphisms from A n+ i to A n for all n, we can construct an inverse limit 
of the algebras A n ; let A^ be the subalgebra of this inverse limit generated by the element 1. 
Alternatively, one can define A^ to be the set of equivalence classes of words in the generator 1 
using the operation *, where two words are considered equivalent if they evaluate to the same result 
in A n for all n. Then A^ is a countably infinite left-distributive algebra on one generator. One 
can show that is free if and only if every word in the generator 1 evaluates to a nonzero result 
in A n for some n. 

One can define a second operation o n on A n or A' n by the formula 



(2.3) 



a o n b = (a * n (b + 1)) - 1, 
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where the addition and subtraction are performed modulo 2 n . Straightforward inductions show 
that the resulting two-operation algebras, called P n and P' n , satisfy Laver's laws (1.1). One also 
has the same homomorphisms as before from P n +\ to P n , so one can take an inverse limit and then 
let Poo be the subalgebra generated by 1; in contrast to the finite algebras, the infinite algebra P^ 
has elements that are not in (assuming is free). 

Connections between finite algebras and embeddings. The review so far has been rather 
distorted historically. In fact, the finite algebras A n were not discovered until Laver proved that 
certain homomorphic images of the elementary embedding algebra Aj are finite, and determined 
their structure explicitly. This result can now be stated as follows. Recall that, for any limit ordinal 

(3 < A, ki fl V/3 and (koi) n Vp depend only on k n Vp and i n Vp, so the restrictions k fl Vp for k G Aj 
form an algebra which is a homomorphic image of Aj, and hence is left-distributive. 

Theorem 2.6 (Laver [7]). For any n, the algebra {kC\V ln :k G Aj} is finite, of size 2 n , and is 
isomorphic to A' n , with j CiV ln corresponding to 1. Similarly, {kDV ln :k G Vj} has size 2 n and is 
isomorphic to P' n , with j C\V ln corresponding to 1. ■ 

So there is a natural homomorphism from Aj to A' n which sends j to 1. In analogy with the 
natural homomorphisms from A' m (m > n) to A' n , we will use the notation k i-> k mod' 2 n for 
this new homomorphism. When we are working with A n instead of A' n , the homomorphism is 
k k mod 2 n ; this again sends j to the generator 1 (except for n = 0, in which case it sends j 

to 0, of course). The 'identity' restricted embedding, namely k(~)V ln where crk> j n , corresponds 
to 2 n in A' n or in A n . 

The above homomorphisms give a natural homomorphism from Aj onto A^. Since Aj is known 
to be free, we have that A^ is free iff this homomorphism is injective. 

It turns out that, even without assuming the large cardinals necessary to define Aj, one can 
talk about critical points of 'embeddings' purely in the context of the algebras A n . Instead of the 
elementary embedding algebra Aj, we can just use the free left-distributive algebra F{j} on one 
generator j. For any k G J~{j}, we can express A; as a word in the generator j; define k mod' 2 n to 
be the result of evaluating this word in A' n with j set to 1. (Since A' n is left-distributive, two words 
giving the same k G F{j} wm gi ye the same result for k mod' 2 n .) Define k mod 2 n analogously. 
These maps are homomorphisms from T{j} to A' n and A n , and they give a homomorphism from F{j} 
onto Aqo, which is an isomorphism iff A^ is free. Note that j( m ) mod 2 n = m mod 2 n , where the 
right side is the ordinary 'mod' operation on numbers. 

Now make the following formal definitions: cr k > 7„ is defined to mean k mod 2 n = 0; and 
fc(7m) ^ 7n is defined to mean (fcj'(2 m )) mod 2 n = (equivalently, the period length of k mod 2 n 
in A n is at most 2 m ). Also, say that k = k' if k mod 2 n = k' mod 2 n . 

We recalled earlier that A^ is free if and only if, for every k G ^{j}, there is a largest n such 
that k mod 2™ = 0. In this case, we naturally say that cr k = 7„ for this n. Similarly, we say 
that k (7 m ) = 7 n if n is largest such that k{-y m ) > 7„. One can now show [3] that all of the basic 
laws about the critical point and application operations (e.g., cr (ki) = k(cvi)) work just as well 
for these new definitions. 

If we do not assume that A^ is free, then there may be elements k of F{j} for which the formal 
statement cr/c > j n holds for all n; we might say that "cr k lies above all of the 7 n 's." Similarly, 
k(^ m ) might lie above all of the 7 n 's. However, if one is careful about stating assertions so as to 
allow for this possibility, then one can continue to perform the standard manipulations with critical 
points to get valid results about the algebras A n , even without assuming that A^ is free. 
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One way to make sure that this works is to state all such manipulations in terms of the primitive 
notions cr k > j n and k(~y m ) > 7„. This is the reason that Propositions 2.3 and 2.4 were stated in a 
somewhat unnatural way: only these primitive notions are used. It is now not hard to see that the 
proofs in section 7 of Dougherty and Jech [3] work to show that the assertions in Propositions 2.3 
and 2.4 hold in the finite algebra context, even without the assumption that is free. 

One can summarize these results as follows: 

Proposition 2.7. The following results hold both in the context of elementary embeddings and in 
the context of the finite algebras A n and P n : 

• The left distributive law and the identities (1.1). 

• Propositions 2.3 and 2.4- 

• cr k > jn if and only if k mod 2™ = 0. 

• k = k'if and only if k mod 2™ = k' mod 2 n . 

• The mapping k i— > k mod' 2 n is a homomorphism from Aj (or F{j}) to A' n which sends j 
to 1. 

In the finite algebra context, these results are provable without any large cardinal or freeness as- 
sumptions (in fact, in Primitive Recursive Arithmetic). ■ 

Using these facts, one can perform an argument in terms of elementary embeddings and then 
easily translate it into a result about finite algebras. 
For example, consider the following result of Laver. 

Lemma 2.8 (Laver [6, Lemma 5]). For any k, eo, ei, . . . , e n € S \, we have 

/ce ei ...e n = fe(e ei . . . e n ), 
where 6 n is the minimum of ke ei . . . e^cr k) for i < n. 

Proof. Induct on n. We have e n °= ke n , so ke ei . . . e n ° 1 = l( " ^ ke ei . . . e n -i(ke n ). By the 

Q 1 

induction hypothesis, ke^ei . . . e n _i = fe(eoei . . . e„_i), so 

ke ei . . . e n -i{ke n ) = x k(e e 1 . . . e n _i)(ke n ) = fc(e ei . . . e n ). 

Putting these together gives ke^ei . . . e n min ( fce ° ei "- e rir 1 ( crfc )' e ™- 1 ) k(e$e\ . . . e„), which is the desired 
result. ■ 

This argument goes through verbatim in the finite algebra context if A^ is free. If we do not 
assume this, then we have to allow for the possibility that cr/c and/or 0, is above all of the 7 n 's, 
but this causes no difficulties in the argument. Alternatively, one can rephrase the result as "if 
cr k > j r and ke^ei . . . ei(y r ) > 7 m for all i < n, then keoei . . . e n = k(eoe\ . . . e n )" and modify 
the proof accordingly. 

So, for the rest of this paper, we will phrase many results and proofs in the language of ele- 
mentary embeddings, and then draw conclusions about the finite algebras; these conclusions will 
be valid without large cardinal or freeness assumptions as long as we use only the basic methods 
that are justified by Proposition 2.7. One can go in the reverse direction as well; for instance, 
Proposition 2.5(d) implies that, if k G Aj and k mod 2 n+1 > 2", then k(^ m ) ^ 7„ for all m. 

Since we can talk about critical points in the finite algebra context, we can define the function F 
there. In fact n n is the critical point of the embedding = . . . . ■ )), where there 
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are n + 1 j's. So F(n) should be the m such that cr = 7 m ; that is, the largest m such that 
jM mod 2 m = 0. The Laver-Steel theorem in this context is the statement that F{n) exists (as 
a finite number) for all n; that is, for every n there is an m' such that mod 2 m ^ 0. This 
statement is also equivalent to the assertion that is free [3]. 

Using Wehrung's characterization of A n , we can get purely algebraic versions of these statements. 
For instance, ll F(n) is finite" can be restated as: There exists an m > such that the left distributive 
law and the equation j( m+ i) = j do not imply the equation j^j = j. 

Consider a particular n, say n = 4. If F(4) is finite, then its value can be computed; one simply 
has to write a program which evaluates for j = 1 in the finite algebras Ai,A 2l ... 

successively, and stops when it gets a nonzero result. This gives a very straightforward computer 
program which is known to terminate, but the only known proof of termination uses an extreme 
large cardinal hypothesis. (Since extremely large lower bounds on F(4) are known [2], it is hopeless 
to actually run such a program to determine ^(4).) 

The finite algebras in backward form. Let B n be the algebra on the set {0, 1, . . . , 2 n — 1} with 
one binary operation * n defined by the following recursive equations: 0* n x = x, x* n (2 n — 1) = x — 1 
for x > 0, and x * n (y — 1) = (x * n y) * n (x — 1) for x, y > 0. Comparison with the defining equations 
for A' n shows that the map x i— > 2 n — x is an isomorphism between A' n and B n , so results about A' n 
can be translated into results about B n and vice versa. This reversal obviously has no effect on 
the structure of the algebras, but it enables one to see certain numerical patterns that are more 
obscure in the original algebras. 

Let us start with some direct translations of results from A' n . The algebra B n is left-distributive 
and generated by the element 2 n — 1. If x, y € B n and x ^ 0, then x * n y < x. Reduction 
modulo 2 n is a homomorphism from Bn (N > n) to B n . For any nonzero x in B n , the sequence 
x * n 0,x * n 1, . . . is periodic with period length 2 m for some m < n; each period of the sequence 
starts with and strictly increases to x — 1. The set {x * n i: < i < 2 m } is equal to the subalgebra 
generated by x — 1, and this subalgebra is isomorphic to B m . Clearly x is the largest element of 
the subalgebra generated by x. If y is in the subalgebra generated by x, then either y = x or the 
subalgebra generated by y is strictly smaller than that generated by x. 

In addition to the homomorphisms above, B n is literally a subalgebra of Bjy for N > n. (To 
see this quickly, note that the inequality x *n V < x implies that {0, 1, . . . , 2 n — 1} is closed under 
the operation *tv; reduction modulo 2 n is the identity on this set and must send *n to * n , so *n 
matches * n on this set.) Therefore, we can drop the subscript and just write * for the operation 
on B n . 

One can take the inverse limit of the algebras B n , getting the same result as for A' n in reversed 
form. One can also take the union of the algebras B n to get a countably infinite algebra B^ with 
operation * which is still left-distributive. This algebra is quite different from the algebra A^; in 
particular, it is not free, since the subalgebra generated by any element is finite. Since the period 
length of x in B n does not depend on n (it is the least positive I such that x * I = 0), the sequence 
x * 0, x * 1, . . . is still periodic in B^ with the same period. 

Let (x) denote the subalgebra of B^ (or of B n for any n with 2 n > x) generated by x. Reduction 
modulo 2 n maps (x) onto (x mod 2 n ). When one moves from B n to B n+ \, if x mod 2 n+1 = 
x mod 2 n , then (x mod 2 n+1 ) = (x mod 2 n ); if x mod 2 n+1 = (x mod 2 n )+2 n , then, since reduction 
modulo 2 n maps (x mod 2 n+1 ) (whose size is a power of 2) onto (x mod 2 n ), (x mod 2 n+1 ) consists 
of either all or half of the numbers y and y + 2 n , where y is in (x mod 2 n ). In particular, one can 
now prove by induction on n that, if y is in {x mod 2 n ), then for any position at which the binary 
representation of y has a 1, the binary representation of x also has a 1. In particular, this holds 
when y £ (x). It follows that the size of (x) (i.e., the period of x + 1) is at most 2 r , where r is the 
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number of 1 bits in the binary representation of x. Numbers x for which this bound is attained will 
be very useful later (for instance, because the isomorphism from B r to (x) is clear: one uses the 
1 bits of x as a template, and transfers the bits of a given r-bit number to the specified locations, 
filling in 0's elsewhere). 

One can translate the preceding results back into the language of A' n or A n , obtaining the 
following. The map x i— > x + 2" is a homomorphism from A' n to A' n+1 . For any x in A n , if we write 
the binary expansion of x with enough leading 0's to give exactly n bits in all, and r of these bits 
are 0, then the period length of x is at most 2 r ; in fact, for any y, and for any position at which 
the binary representation of x has a 1, the binary representation of (x * y — 1) mod 2 n also has a 1. 
These results could have been proved directly, using results such as Proposition 2.5, instead of via 
the reverse algebras. 

Some of these results can be translated into statements about embeddings and critical points. 
One of these will be useful later: 

Lemma 2.9. Let k be in Aj. If k("f m ) = j n , then k mod 2^ (for N > n) must have a in position 
number n (numbered from the right starting with 0) of its binary expansion. ■ 

It is easy to see that A: (70) = 7 n where n is the position of the rightmost in the binary expansion 
of k mod 2^, if N > n. We will see later (Proposition 6.6) that the position of the next bit of 
k mod N always gives A; (71), but later values fc(7 m ) may skip over bits. 

3. Column 1 

Laver showed that the critical points 70, 71, ... , which were defined to be the first critical points 
of any embeddings in Aj, are actually the critical points of a specific list of embeddings, namely 
the embeddings J(2«), n = 0, 1, . . . . He also showed [7] that j(2™)(7n) = 7n+i- We can put these two 
statements together by saying u j(2 n ) has critical sequence beginning 7„ 1— > 7„+i." Similar notation 
will be used for longer sequences: "k has critical sequence beginning a *— > a' >— > a"" means that 
cr/c = a, k(a) = a', and k(a') = a". 

This sort of 'linked list' of embeddings and critical sequences is a key tool in Dougherty [2]. 
The main construction in that paper is an infinite array of embeddings, in which each column is a 
list of embeddings whose critical sequences are linked in a certain way (which varies from column 
to column); each column is used iteratively to produce the preceding column. Actually, the paper 
gives two constructions of this sort, one in section 2 and the other in section 4; these will be referred 
to here as 'the regular construction' and 'the irregular construction.' For instance, column 1 (in 
both variations of the construction) is a list of embeddings ao,ai,... such that a n has critical 
sequence beginning Q 1— > a n 1— ► a n+ \. Given such a list, one can produce 2 n critical points below a n 
[2, Lemma 2]. 

As stated in the introduction, the goal here is to produce an 'optimal' construction of this type, 
which yields all of the critical points of members of Aj. If this optimal construction is to have a 
column as described above, then we must have C = 7o an d a n = 72™ for all n. The following result 
shows that one can indeed produce a sequence of embeddings having this form. 

Theorem 3.1. For any natural number n, the following are true: 

(a) For any m < n, there exists k in Aj such that k has critical sequence beginning ^ 2 n -2 m + 1 l— * 

72" -2 m l— ► 72" • 

(b) // k mod 2 2 7^ 0, then k(j2 n ) > J2 n + 1 - Furthermore, if also k mod 2 2 > 2 2 , then 

fc(72") > 72"+! • 

( c ) J(2 2 "-i) has critical sequence beginning 70 1— > 72" 1— > 72^+1. 
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Proof. By simultaneous induction on n. The case n = of (a) is vacuously true. 

First, suppose (a) is true for n. We prove (b) for n by downward induction on k mod 2 2 . 

If k mod 2 2 = 2 2 — 1, then the rightmost 2" bits of the binary expansion of k mod 2 2 
are all 1, so at most 2™ of the rightmost 2 n+1 bits are 0. Hence, by Lemma 2.9, the 2 n + 1 
critical points k(-y ), ^(71), . . . , k(^ 2 n ) cannot all be below 72^+1, so k{^ 2 n ) > 72»+*- If we also have 
mod 2 2 " > 2 2 ", then at most 2" — 1 of the rightmost 2 n+1 bits of k mod 2 2 ™ are 0, so we get 
fe(72"_i) > 72«+i and hence fc(72™-i) > 72»+i. 

Now suppose k mod 2 2 " < 2 2 " - 1. Then kj mod 2 2 " = (k mod 2 2 ") + 1 > 0, so £(70) = 
cr (kj) < 72". On the other hand, since k mod 2 2 " / 0, we have cxk < 72", so fc(72"-i) > 72"- 
Therefore, there is a largest m such that k(^2 n -2 m ) > 72", and this m is less than n. Apply (a) 
to get k G Aj such that k has critical sequence beg Hilling r ) / 2 n 2 m ~'~ 1 ' — ^ T2 n 2 m ' — ^ T2 n ■ 

Then 

cr (&fc) = fc(cr fc) = /c(7 2 n_2>™+i), which is less than 7 2 ™ by the minimality of m, so kk mod 2 2 7^ 0. 
Hence, by (2.2), kk mod 2 2 ™ > k mod 2 2 ". Therefore, we can apply the inner inductive hypothesis 
to get kk(^2 n ) > 72™+! • Since 

fc(72«) = k(k(j2"-2™)) = kk(k(^ 2 " -2™)) > kk(72™), 
we have £1(72") > 72»+!- If we also have A: mod 2 2 + > 2 2 , then (2.2) gives kk mod 2 2 + > 
k mod 2 2 + > 2 2 , so the inner inductive hypothesis gives kk{^ 2 n ) > 72«+ x and hence k(j2 n ) > 
7 2 n+i. This completes the proof of (b) for n. 

Next, we prove (c) for n from (b) for n. For any r such that 1 < r < 2 2 " , we can apply Lemma 2.8 

with k = j( 2 2" r ) and eo = ei = ■ • • = j to get j(22™( r +i)) = 3(2 2n r) C?(2 2 "))> where is the minimum 
of j(2 2n r+s) ( cr k) for 1 < s < 2 2 . Since k mod 2 2 = 0, we have cr k > 72™. We can apply (b) to 
get J(2 2 "r+s)(72") > 72"+ x f° r all of the above values of s, so we must have 6 > 72«+i+i- Therefore, 

i(2 2 "(r+i)) 2 = +1 j( 2 2" r ) (i(2 2 ™))- Applying this for all r from 2 2 — 1 down to 1 gives 

7 2 ™ + i + i . 
J( 2 2"+i) = J((22"-1)22")J(2 2 ") 

= J((22"-2)22")J(2 2 ")J(22") 

l2 n + 1 + l 

= J(22")J(22") • • • J(22") (2 times) 

= i(22"-i)i(2 2 ")- 

So crj^ 22 n+i-| and cr (j( 2 2 " -i)i(2 2 ™)) = J(2 2 "-i)(72") are the same if either is below 72«+i+i- But 
Theorem 2.1 gives cr j( 2 2 ") = 72" and crjp 2n +ij = 72^+1, so we must have j(2 2 ™ -i)(l2 n ) = 72«+i- 
Since j(22™-i)(7o) = cr J(2 2 ™) = 72™, i(2 2 "-i) has critical sequence beginning 70 1— > 72™ 1— » 72«+i 5 as 
desired. 

Finally, assuming (a) and (c) hold for n, we get (a) for n + 1 as follows. Note that, since there 
are only 2 n — 1 critical points between 7 2 ™ and 72^+1, which must include the ordinals J(22"-i)(7r) 
for < r < 2 n , we must have j(2 2 " -i)(lr) = 72™+r for < r < 2 n . Now, to get a for n + 1 
and m where m = n, use the embedding j(2 2n For m < n, apply (a) for n and m to get k; then 
i( 2 2™_i)(A;) works for n + 1 and to. ■ 

This result makes it reasonable to believe that, in the ultimate two-dimensional construction 
(if it exists) , column 1 is a sequence of embeddings 00 , a\ , . . . such that a n has critical sequence 
beginning 70 1— > 72™ > 72^+1 ■ However, these embeddings will probably not be precisely the 
embeddings j(22™_i), but instead iterative combinations of embeddings from the next column. 
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4. A THEOREM ABOUT THE FINITE ALGEBRAS A n OR B n 

This section gives a result about the left-distributive algebras A n which will be needed in order to 
extend the results of section 3. It is actually stated and proved in terms of the backward algebra 
(the union of the finite algebras B n ), because the patterns that it describes are clearer in this form. 

The history of this result is interesting. Weaker versions of it, formulated in terms of critical 
points and embeddings in Aj, were conjectured as early as 1986, before the finite algebras A n had 
been discovered; but attempts to prove these results by induction were not successful. Much later, 
computer experiments displaying bit patterns occurring in period sequences in B^ related to these 
conjectures led to the statement of the stronger theorem below, which turned out to be strong 
enough that an inductive proof could be pushed through. 

All variables below (except B and 0) range over nonnegative integers. Recall some number- 
theoretic notations: 2 k \ y means that y is a multiple of 2 k , and 2 k || y means 2 k \ y but 2 k+1 \ y. 

Theorem 4.1. Suppose a = x + 2 mn y, where n is a power of 2, y < 2 n , and 2( m+1 ) n | x. Let 
2 l be the period length of x + 1 in B^, (i.e., the size of (x)), and suppose I < n. Then (a) is 
isomorphic to (2 l+n — 2 n + y), and the isomorphism can be described as follows: for any i, if 
( 2 i+n _ 2" + y + 1) * i = 2 n w + y' where y' < 2 n , then (a + 1) * i = (x + 1) * w + 2 mn y'. 

The main case of interest in Theorem 4.1 (from which the full theorem can be deduced) is the 
case y = 2 n — 1, which is worth stating separately: 

Corollary 4.2. Suppose a = x + 2 mn (2 n — 1), where n is a power of 2 and 2 ( - m+1 ' )n \ x. Let 2 l 
be the period length of x + 1 in B^, and suppose I < n. Then a + 1 has period length 2 l+n , and 
(a + 1) * (2 n i + i') = {x + l)*i + 2 mn i' for all i and all i' <2 n . 

The proof of Theorem 4.1 will be by induction on a. We start by establishing several consequences 
of the induction hypothesis. 

Lemma 4.3. Assume that Theorem 4-1 (and hence Corollary 4-2) holds for the case a < L, y ^ 0. 
If s < s' , 2 s — 2 s < L, and s' — s is less than or equal to some power of 2 which divides 2s, then 
2 s — 2 s + 1 has period length 2 s ~ s (the largest possible). 

Proof. Induct on s' — s; the case s' — s < 1 is trivial. (The period of any x > 1 includes both and 
x — 1, so its length is at least 2; if s' — s = 1, then 2 s ~ s has only one 1 bit, so the period length of 
2 s — 2 s + 1 is at most 2.) If s' — s > 1, let n be the largest power of 2 less than s' — s. Note that 
n | s, so n | (s + n); also, s' — (s + n) < n. Apply the induction hypothesis to see that 2 s — 2 s+n + 1 
has period length 2 s ~( s + n ) . Now apply Corollary 2 with x = 2 s — 2 s+n and m = s/n to get the 
desired result. ■ 

Lemma 4.4. Assume Theorem 4-1 holds for a < L. If x + 2 s — 1 < L, 2 s \ x, and x + 1 has period 
length 2 l , where I is less than or equal to some power of 2 dividing s, then (x + 2 s ) * (2 s i + i') = 
(x + 1) * i + i' for all i and all i' < 2 s (so the period length of x + 2 s is 2 l+s ). 

Proof. Induct on s; the case s = is trivial. If s > 0, let n be the largest power of 2 dividing s; 
then 2n divides s — n. Apply Corollary 2 with m = s/n — 1 to see that x + 2 s — 2 s ~ n + 1 has period 
length 2 l+n and (x + 2 s - 2 s ~ n + 1) * (2 n i + j) = (x + 1) * i + 2 s ~ n j for all i and all j <2 n . Now, 
since I + n < 2n, we can apply the induction hypothesis to get 

(x + 2 s ) * (2 s i + 2 s ~ n j + k) = (x + 2 s - 2 s ~ n + 1) * (2 n i + j) + k 

= {x + 1) * i + 2 s ~ n j + k 
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for all i, all j < 2 n , and all k < 2 s n , which gives the desired result if, given i', we choose j and k 
so that i' = 2 s ~ n j + k. ■ 

Lemma 4.5. If a = 2 n x + y where n is a power of 2, x < 2™, and y < 2 n — 1, then (a + 1) *2 n = 0. 

Proof. This is the finite algebra form of the first sentence of Theorem 3.1(b). ■ 

Lemma 4.6. Assume Theorem 4-1 holds for a < L. If a, x, y, n, m, and I are as in Theorem 4-1, 
a<L, andy <2 n - 1, then (a + 1) 5 2™ = 0. 

Proof. By Theorem 4.1, the period length of a+1 is the same as the period length of 2 l+n — 2 n +y+l, 
which is at most 2 n by Lemma 4.5. ■ 

The preceding lemmas were consequences of Theorem 4.1 that will be useful as applications after 
the theorem is proved (so that the initial assumption becomes redundant). The next two, however, 
are just parts of the theorem itself to be used in the induction. 

Lemma 4.7. Assume Theorem 4-1 holds for a < L. If n is a power of 2, < y' < 2 n , 2( m+1 ) n | x' , 
2 (m+i)n \ X; x > + i p er i d i engt h a t most 2 n , and x' + 2 mn y' < L, then (x f + 2 mn y') * (x + 
2 mn( 2 n - 1)) = x' + 2 mn (y' - 1). 

Proof. Apply Lemma 4.6 to x' + 2 mn (y' — 1) to show that x' + 2 mn (y' — 1) + 1 has period length 
at most 2™. Then apply Lemma 4.4 with s = mn to show that 

(x' + 2™V) * (x + 2 mn (2 n - 1)) = (x' + 2 mn {y' - 1) + 1) * (x/2 mn + 2 n - 1). 

But x/2 mn + 2 n -l = 2 n -l (mod 2 n ), so {x' + 2 mn {y' -l) + l)*{x/2 mn + 2 n -1) = x' + 2 mn {y' - 1). 

■ 

n times 

Define x*^ to mean x * x * ■ ■ ■ * x (associated to the left); this is equal to (x + 1) * (K — n) if 
K is a multiple of the period length of x + 1. 

Lemma 4.8. Assume Theorem 1 holds for a < L. If n is a power of 2, 2( m+1 ) n | x, x' is in (x), 
x' < L, x < L, and x + 1 has period length at most 2 n , then 

x'*{x + 2 mn (2 n - 1)) = [x' * x) + 2 mn (2 n - 1). 



Proof. Induct on x. The case x' = is trivial, so assume x' ^ and hence x / 0. Since x' is in (x), 
any 1 bits in the binary representation of x' must correspond to 1 bits in the binary representation 
of x. In particular, this gives x' = (mod 2( m+1 ) n ), so x' * (x + 2 mn (2 n - 1)) = x + 2 mn (2 n - 1) 
for some x such that x < x' and 2^ m+1 " )n \ x. We need to show that x = x' * x. 

Since x / 0, there is a largest number M such that 2 M < x; clearly x < 2 M+1 and M > (m+l)n. 
Now (x mod 2 M ) is a multiple of 2( m+1 ) n less than 2 M , so (x mod 2 M ) + 2 mn (2 n - 1) < 2 M < L. 
Since x' is in (x), (x' mod 2 M ) is in (x mod 2 M ). Therefore, we can apply the induction hypothesis 
to see that 

(x' mod 2 M ) * ((x mod 2 M ) + 2 mn (2 n - 1)) = ((x' * x) mod 2 M ) + 2 mn (2 n - 1). 

Therefore, x = x' * x (mod 2 M ). Since x' * x is a member of (x) and is less than x (in fact, less 
than x'), the period length of (x' * x) + 1 is less than that of x + 1 and hence less than 2 n . Since the 
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period length of x + 1 is at most twice that of (x mod 2 M ) + 1 = ((x' * x) mod 2 M ) + 1, x + 1 has 
period length at most 2 n . Therefore, we can apply Corollary 4.2 to get (x + 2 mn (2 ra — 1))*( 2 ) = x. 

Since x' < 2 M+1 , x' — 1 has at most M 1 bits in its binary expansion, so x 1 has period length at 
most 2 M . We can apply Corollary 4.2 to get ((x mod 2 M ) + 2 mn (2 n - 1))*( 2 ") = (x mod 2 M ), so 
(x + 2 mn (2 n - 1))*( 2 ") = x (mod 2 M ). Finally, by the distributive law, 

x= (x + 2 mn (2 n - 1))*( 2 ") 
= (x'* {x + 2 mn {2 n - 1)))*( 2 ") 
= x' * (x + 2 mn (2™ - 1))*( 2 ") 
= x' * X, 

as desired. ■ 

Proof o/ Theorem J^.l. We will show that, if Theorem 4.1 holds for a < L, then it holds for 
a < L + 1. Suppose a, x, y, m, n, and Z are as hypothesized, and a < L. First, consider the 
case x = 0. We show by induction on z that 2 mn z * 2 mr V = 2 mra (z * z') for < y. This is 
trivial for z = 0. For z > 0, we have 2 mra z * 2 mn z' = (2 mn (z — 1) + 1) * z' by Lemma 4.4, and 
(2 mn (z - 1) + 1) * z' = (2 mn (z - l))*(2"-z') = 2 mn (z - l)*( 2 "-^') = 2 mn (z * z') by the induction 
hypothesis on z. This completes the induction on z. So, in particular, (y) is isomorphic to (2 mn y) 
via the map z t— > 2 mn z, which is the desired result. 

Second, we handle the case where both x and y are nonzero. This implies x < L and 2 i+n — 2 n < 
L, so we can apply Lemma 4.3 to see that 2 l+n — 2™ + 1 has maximal period length 2 l ; that is, 
2 n (i + 1) * (2' +n - 2 n ) = 2 n i for all t < 2 1 - 1. Hence, (x) and (2 l+n - 2 n ) are isomorphic, with 
the isomorphism (p mapping (x + 1) * w to 2 n w for w < 2 l . This implies that (x + 1) * w + 1 and 
2 n w + 1 have the same period length for each such w. 

The conclusion of Theorem 4.1 certainly holds for infinitely many i, namely for all common 
multiples of the period lengths of a + 1 and 2 l+n — 2 l + y + 1 (since this gives [a + 1) * i = = 
(2' +n — 2' + y + 1) * i). Therefore, it will suffice to show that, if the conclusion holds for i + then 
it holds for i. 

Suppose the conclusion holds for i + 1. If (2 l+n - 2 n + y + 1) * (i + 1) = 0, then (a + 1)5 (i + 1) = 0, 
and hence (2 l+n — 2 n + y + \) * i = 2 l+n — 2 n + y and (a + 1) * i = a, so the conclusion holds 
(with w = 2 l - 1 and y' = y). So suppose {2 l+n - 2 n + y + 1) * (i + 1) = 2 n w + y' ± 0; then 
(a + 1) * (i + 1) = x' + 2 mn y', where x' = (x + 1) * iw. 

Write 2 n w + y' - 1 = (2 n w + y') 5 (2 l+n - 1) as 2 n u; + y with y < 2 n . If y' > 0, then w = w and 
y = y' — 1; if y' = 0, then w = u> — 1 and y = 2 n — 1. Now Lemma 4.7 (if y' > 0) or Lemma 4.8 (if 
y' = 0) gives (x' + 2 mn y') * (x + 2 mn (2 n - 1)) = x + 2 mn y, where x = (x + 1) * w. 

As noted before, the isomorphism 4> ensures that (x + l)*u) + l and 2 n w + l have the same period 
length; call this length 2 l . Clearly I < I. We can now apply the main induction hypothesis to both 
x + 2 mn y and 2 n w + y to see that they both generate subalgebras isomorphic to (2 l+n — 2 n + y) 
and hence to each other, with the isomorphism (f) mapping (x + 1) * v + 2 mn z to {{2 n w + 1) * v) + z 
for all valid values of v and z < 2 n (i.e., for all v and z such that 2 n v + z is in (2 l+n — 2 n + y)). 

We have 2 l+n - 2 n + y = (2 i+n - l)^ 2 "-^. We next show that x + 2 mn y is congruent to 
(x + 2 mn (2 n - l)) s ( 2 "-w) modulo the period length of x' + 2 mn y'. Since x + 0, let 2 M be the largest 
power of 2 less than or equal to x. Then we can apply Corollary 4.2 to (x mod 2 M ) + 2 mn (2 n — 1) 
to get 

((x mod 2 M ) + 2 mn (2 n - l)) 5 ( 2 "-») = (x mod 2 M ) + 2 mn y. 



CRITICAL POINTS IN AN ALGEBRA OF ELEMENTARY EMBEDDINGS, II 



15 



Since x' + 2 mn y' — 1 is less than 2 M+1 — 1, it has at most M 1 bits in its binary expansion, so the 
period length of x' + 2 mn y' is at most 2 M , so we have the desired congruence. 
Now we can write 

(2 l + n - 2 n + y + 1) * i = (2 n w' + y') 5 (2 l+n - 2 l + y) 

= (2 n w' + y')*(2 l+n _i)*( 2 "-w) 
= {{2 n w' + y') * {2 l+n - \)f^ n -y) 
= (2"u> + #) 5 < 2 "-»>, 



(a + l)*i = (x' + 2 mn y') * [x + 2 mn y) 

= [x' + 2 m V) * [x + 2 mn {2 n - l)f( 2 ""f) 
= {{x' + 2 mn y') *(x + 2 mn (2 n - i)))*(z n -y) 
= (x + 2 mn y)* (2n - y '>. 

The isomorphism (f> sends x + 2 mn y to 2 n w + y, so it sends (2 l+n — 2 n + y + 1) *i to (a+ 1) *«; hence, 
there must exist numbers v and z with 2 < 2 n such that (2 l+n — 2 n + y + 1) * i = + + z and 
(a + 1) * i = (2 n w + 1) * v + z. The isomorphism sends x to 2 n w), so it sends {x + 1) * v = £*( 2 
to (2 n w + l)*v = (2 n w)*( 2 ~ v \ so there must be a number u < 2 l such that (x + l)*v = (x + l)*u 
and (2 n w + l)*t; = 2 n u. Therefore, (a + l)*i = (x + l)5n + z and (2* +n - 2 n + y + 1) * i = 2 n u + z, 
which is the desired result for i. 

This completes the case when x and y are both nonzero. Finally, we consider the case y = 0. In 
this case, all that needs to be shown is that 2 l+n — 2 n + 1 has maximal period length 2 l . But this 
follows from Lemma 4.3, which we can apply now since we have proven the Theorem for a < L + 1 
and nonzero y. This completes the proof for a < L + 1, so the induction is complete. ■ 

5. Applications to algorithms for computing in ^ and B n 

As noted earlier, computer experimentation in the finite algebras led to the formulation of 
Theorem 4.1. We will now see that this theorem leads to new algorithms which allow such exper- 
imentation to proceed much farther in the sequence of finite algebras. Again we will work with 
the backward algebras Bn; but this clearly gives algorithms for as well, since the isomorphism 
between Ajv and is just a trivial two's complement. 

Since B^ is defined by recursive formulas, one can easily write down the obvious recursive 
algorithm for computing x * y in B^. However, such an algorithm would take a very long time 
even for small values of N, since very many intermediate products x' * y' are needed, and most of 
them are needed more than once. Alternatively, one could precompute the entire multiplication 
table for Bn once relatively quickly and store it, but this requires too much storage space if iV is 
much more than 10. 

In order to compute x*y for x,y <G Bn, where (the case x = is trivial), it suffices to 

know the entire period sequence of x; if this period has length 2 l , then x * y is just element number 
(y mod 2 l ) in the period. 

Since the period is strictly increasing, there is a threshold t < 2 l such that, for < y < 2 l , we 
have x*y> 2 Ar_1 if and only if y > t. Hence, if we know I and t, we can immediately read off the 
2 Ar_1 's bit of x * y for any y; it is 1 if y mod 2 l > t, otherwise. Note that t > 0, since x * = 0. 
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If the 2^ 1 's bit of x — 1 is 0, then the 2^ 1 's bit of x * y must be for any y, so t = 2 l ; if the 
2 Ar " 1 's bit of x * y is 1, then < t < 2 l . 

To get the 2 m 's bit of x * y for m < N — 1, we just have to reduce everything modulo 2 m+1 and 
work as above in B m+ \. Hence, the entire period of x can be described by numbers l(m) and t(m) 
for < m < n; the 2 m 's bit of x * y is 1 iff y mod 2^ m ) > t(m). 

If the 2 m 's bit of x — 1 is 0, then one does not need to keep track of l(m) and t(m), because l(m) 
is l(m — 1) (0 if m = 0) and t{m) is 2 l(jn \ Also, if x' — 1 = (x — 1) mod 2 n , then the values l(m) 
and t(m) for x' are the same as those for x for all m < n' . This means that, in order to compute 
x * y for all x and y in B^, it suffices to have a pair of numbers l(x) and i(x) for each x > 2 in Bjy, 
namely the numbers Z(m) and t(m) for x, where m is the position of the topmost 1 bit in x — 1. 

The values of t(m) and t(x) are not arbitrary, as we will show from the following lemmas. 

Lemma 5.1. If x has at most two 1's in its binary expansion, then x + 1 has maximal period 
length in B^ (1, 2, or A if x has zero, one, or two 1's, respectively). 

Proof. The case of zero 1 bits is trivial. If x has exactly one 1 bit, then the period of x + 1 has 
length at most 2 but includes both and x, so the period length is exactly 2. If x has exactly two 
1 bits, suppose x = 2 M + 2 m with M > m; then 2 M + 1 has period length 2 by the above, so x + 1 
has period length 4 by Corollary 4.2 (with n = 0). ■ 

Corollary 5.2. If x has at least two 1's in its binary expansion, then x + 1 has period length at 
least 4. 

Proof. Apply Lemma 5.1 to x mod 2 m+1 , where m is the position of the second-to-last 1 in the 
binary expansion of x. ■ 

Lemma 5.3. For any x and n, x * 2 n is or a power of 2. 

Proof. We have (x * 2 n ) * (x * 2 n ) = x * (2 n * 2 n ) = x * = 0, so, by Corollary 5.2, x * 2 n has at 
most one 1 bit. ■ 

Lemma 5.4. If l(m) and t(m) are as defined above for x, and m > 0, then either the period of x 
doubles at m (i.e., l(m) = l(m — 1) + 1) and t(m) = 2^ m ) _1 , or the period of x does not double 
at m (i.e., l(m) = l(m — 1)) and t(m) > 2 Z(m ) _1 . 

Proof. The first case is clear, since the period of (x mod 2 m+1 ) + 1 is the period of (x mod 2 m ) + 1, 
followed by that same period with 2 m added to each number. For the second case, it will suffice to 
show that, if l(m) = l(m - 1) > 0, then (x mod 2 m+1 ) * 2 l ^~ 1 < 2 m . But (x mod 2 m+1 ) * 2 l{m ^~ 1 
must be or a power of 2 by Lemma 5.3, and it must be nonzero modulo 2 m because (x mod 2 m ) * 
2 /(m)-i ^ 0j go ^ mus t be less than 2 m . ■ 

So, if the 2 m 's bit of x - 1 is 1, then 2 l ^~ 1 < t(m) < 2 l{m \ so l(m) is the least r such that 
2 r > t(m). 

Similar statements can be made if one is working in Am rather than Bm- Again periods are 
strictly increasing and of length a power of 2, so a complete period can be specified by numbers 
l(m) and t(m) for m < N. However, the maximal period length for x is now the number of 0's 
in the expansion of x as an TV-bit binary number, rather than the number of 1's in x — 1. If 
there are at most two such 0's, then x has maximal period length; if there are at least two, then 
the period length of x is at least 4. Lemma 5.3 now states that the midpoint of a period must 
be of the form 2^ — 2 m for some m, and Lemma 5.4 states that the threshold t(m) must satisfy 
1 < t{m) < 2'( m ) _1 . Note that now t{m) = 2 Z(m ) _1 if the period has just doubled, but we can also 
have t(m) = 2 Z(m ) -1 even if the period has not doubled. 
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To compute in Bjy, it suffices to know the values (l(x) and) t(x) for all x > 2 in Bjy. One can 
precompute these values recursively; knowing l{y) and t{y) for y < x allows one to do repeated 
multiplication by x — 1 in order to generate the entire period of x, from which l(x) and t(x) can 
be determined. Storing these precomputed values in a file is reasonable for N up to 20 or slightly 
more, so the above gives a reasonable algorithm for computing in for such N. 

An algorithm that will handle larger algebras is based on the following lemma. 

Lemma 5.5. Suppose 2 N — 2 s + 1 has maximal period length 2 N ~ S in B^. Then, for any x < 2 N ~ S , 
if the period length of x + 1 is 2 l , then the period length of 2 s (x + l) is 2 l+s and 2 s (x + l)*(2 s i+j) = 
2 s ((x + for all i < 2 l and j < 2 s . 

Proof. Since 2^ — 2 s + 1 has maximal period length, (2^ — 2 s ) is isomorphic to (2 N ~ S — 1), and 
the isomorphism maps 2 s w to w for all w, so 2 s w * 2 s w' = 2 s (w * w') for all w,w' < 2 N ~ S . 

In particular, for any w < 2 N ~ S , we have (2 s w + 2 s ) * (2^ - 1) = 2 s w + 2 s - 1 and (2 s w + 2 s ) * 
(2^ — 2 s ) = 2 s w. Since the period of 2 s w + 2 s is strictly increasing, and its length is at least 2 s (as 
one can see by reducing modulo 2 s ), we must have (2 s w + 2 s ) * (2 N — z) = 2 s w + 2 s — z for all z, 
1 < z < 2 s . In particular, this gives (2 s w + j + 1) * (2 s w + 2 s - 1) = 2 s w + j for j < 2 s - 1. 

If x < 2 N ~ S and x' is in (x), then 2 s x' is in (2 s x). We have 2 s x' * (2 s x + 2 s - 1) = 2 s x + 2 s - 1 
for some x, and the method of Lemma 4.8 lets us show that x = x' * x: 

2 s x = (2 s x+2 s -iy^ = (2 s x'*(2 s x+2 s -l))*^ = 2 s x'*(2 s x+2 s -1)*^ = 2 s x'*2 s x = 2 s {x'*x). 

Also, since x 1 is in (x), its binary representation has l's only where that of x has l's, so 2 s x' + 2 s — 1 
ends in no more 1 bits than 2 s x + 2 s — 1 does, so 2 s x + 2 s is divisible by at least as large a power of 2 
as 2 s x' + 2 s is. This means that, for any j < 2 s - 1, since (2 s x' + j + 1) * (2 s x' + 2 s - 1) = 2 s x' +j, 
2 s x' + 2 s is divisible by the period length of 2 s x' + j + 1, so 2 s x + 2 s is divisible by this period 
length, so (2 s x' + j + 1) * (2 s x + 2 s - 1) = 2 s x' + j. 

Given the above facts, an easy downward induction on a < 2 N shows that, if a = 2 s i + j with 
j < 2 s , then 2 s {x + 1) 5 {2 s i + j) = 2 s ((x + 1) * i) + j. U 

Now suppose that we want to compute the period of a + 1 in B^, and we have a positive s such 
that 2 N — 2 s + 1 has period length 2^"*; suppose also that we already know the period of b + 1 
for all b such that 2 N — 2 s < b < 2^. Write a in the form 2 s x + y where y < 2 s . If x = 0, then we 
can get the period ofa+l = y + l from the period of 2^ — 2 s + y + 1 by reducing modulo 2 s (i.e., 
just take the values l(m) and t(m) for m < s), so assume x > 0. 

Let ai = 2 s x + 2 s — 1, and let 2 l be the period length of x. Then, by Lemma 5.5, the period 
length of a\ + 1 is 2 l+s , and there is an isomorphism </>i between {2 l+s ) and {a\) defined by the 
formula 4>i{2 s i + j) = 2 s {{x + 1) * i) + j for i < 2 l and j < 2 s . Knowing such an isomorphism is 
equivalent to knowing the period of a\ + 1, since (ai + 1) * b = <p\{b mod 2 l+s ). 

Let a 2 = 2 i+s — 2 s + y; then ^1(02) = a, so a 2 + 1 and a + 1 have the same period length. If 
2 l is this common period length, then there is an isomorphism $2 from (2 l — 1) to (02). Hence, 
4>i 4>2 gives an isomorphism between (2 l — 1) and (4>\{a2)) = (a), so the isomorphisms <pi and </>2 
(i.e., the periods of a\ + 1 and 02 + 1) determine the period of a + 1. Another way of looking at 
this is that a = (ai + 1) * 02, so a + 1 = (ai + 1) * ((02 + 1) — 1) + 1, and the right hand side of this 
equation is just the formula for (a\ + 1) o (02 + 1) in the backward version of the algebra Pm- 

We can determine 4>2 from the (known) period of 2^ — 2 s + y + 1 by just keeping those values 
l(m) and t{m) for m < I + s. To determine (f>\ it suffices to determine the period of x + 1, and this 
can be done by a recursive call to this entire procedure. (Since x < a /2 s , each level of recursion 
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reduces the argument by a factor of 2 s , so not many levels of recursion are needed.) Finally, given 
4>i and 02, one can compose them to get the period of a. 

Actually, getting from the ^(m),tj(m) forms of 4>i (i = 1,2) to the l(m),t(m) form of 4>i ° 4>2 
does not seem to be trivial. The I part is easy: l(m) = hihim)). But computing t{m) from 
h, h,ti,t2 seems to require some tricky Boolean and arithmetical manipulations; no general formula 
seems apparent. There are some easy special cases: if ti(m) = 

2h(m) > then £( m ) = 2 1("0; if 

ti{m) = 2' 1 ( m ) _1 (the period doubling case for 4>i), then t(m) = t2(h(m) — 1); if l\(m) = li(m — 1) 
and ti(m) = t\{m — 1), then t(m) = t(m — 1). It turns out that most m's fall under these cases; 
for those which do not, the easiest approach seems to be to do a binary search to find the largest 
u < 2 1 ^ such that </>i(0 2 (u)) < 2 m , and let t(m) = u + 1. 

Therefore, we have an algorithm for computing in which only requires storing period data 
for the last 2 s members of B^, if 2^ — 2 s + 1 has maximal period length. Lemma 4.3 gives many 
cases in which this holds: if s is a power of 2, then any value of N up to 3s is permissible. In 
particular, one can work in B±% if one has stored period data for the last 2 16 members of B^%. 

This leaves the problem of generating such period data to be stored and used. Fortunately, there 
is an efficient algorithm for producing such data by recursion on N. The case N = is trivial. 
Now suppose we want the data for iV and s, and we have such data for N — 1 and s. The data for 
N — 1 and s suffice to do operations (a + 1) * b in Bn whenever a < 2 N — 2 s , because the value of a 2 
arising in the algorithm above is less than 2 l where I is at most the number of 1 bits in the binary 
expansion of ai, and a < 2 N — 2 s implies a\ < 2 N — 1. Now we can simply proceed to compute 
the entire period of 2 N — 2 s + y + 1 for y = 0, 1, . . . , 2 s — 1 successively, by iterated multiplication 
by 2 N — 2 s + y. (The cases y = and y = 2 N — 1 should be handled separately, because they give 
very long periods which are already known.) 

The algorithms described here have been implemented in C++, and work well for N = 48 (using 
a 2.7-megabyte period data file). Increasing iV beyond 48 would require increasing s from 16 to 32, 
and hence would require a period data file many gigabytes long, so we seem to have reached the 
limits of this algorithm. 



6. Applications to critical point computations 

First, we translate some of the results about finite algebras into embedding terms. In this 
section, letters e through k will denote members of Aj or F^}, while other letters will denote 
nonnegative integers. Also, we use the notation k + n to denote k applied to j n times: k + = k, 
k + n + 1 = (k + n)(j). (So + m = j( n + m )- This matches the notation from ^4^, but is the 
reverse of that from B^.) 

Lemma 6.1. Suppose n is a power of 2 and cr k > 7( m +i) n . Then, for any y such that 1 < y < 2 n , 
we have k + 2 mn y ( k + 2(m+ =~ 1 )^) ^ + 2mn _ . equiva i ent iy ; k + 2 mn y - 1 ( k+2(m+1 l"~ 1)iln) 

(k + 2 mn — 1) oj( J ,_ 1 ). 

Proof. Let N be such that j N = (k + 2 (m+1 ^ n - 1)(7„), and apply Corollary 4.2 with x set to 
the member of B^ corresponding to k + 2^ m+l "> n . To get the second form, use the fact that 
e*Arl = e'*Arl implies e = e' in and hence in Pn . ■ 

Lemma 6.2. // 2 n | s, then j(2 s -i){lm) = ls+m for all m < 2 n+1 . 

Proof. Apply Lemma 4.3 with s' = s + 2 n+1 . ■ 
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Lemma 6.3. Suppose n is a power of 2, s is a multiple of n, and crk>^/ s . Then, for any y such 
that 1 < y < 2 s , we have k + y + = 7 ™ k(j^); equivalently, k + y — 1 + = 7 ™ k o j( y -\). 

Proof. Let N be such that 7at = (k + 2 s — l)( 7 n), and apply Lemma 4.4 with x set to the member 
of Bm corresponding to k + 2 s . ■ 

Corollary 6.4. If n, s, and k are as in Lemma 6.3, then k(j n+s ) > (k + 2 s — l)( 7n ). 

Proof. Let S = (k+2 s -1)(7„); then, by Lemma 6.3, k+2 s -l = koj {2a _ iy Since (k+2 s -1)(7„) > S, 
we must have (k o j(2 s -i))(ln) > S. Lemma 6.2 gives J(2 s -i)(7n) = 7n+s, so we have /c(7„_|_ s ) > 5, 
as desired. ■ 

Lemma 6.5. If n is a power of 2, cr k > 7( m +i) n , and 2 < z < 2 n , then (k + 2 mn z — l)( 7n ) > 

Proof. Let iV be such that 7^ = (k + 2( m+1 ) n — 1)(7„), and apply Lemma 4.6 with x set to the 
member of Bn corresponding to k + 2^ m+1 ^ n to get the first part. For the second part, note that 
(k + 2(™+ 1 )" - l)( 7o ) > 7 (m+ i)n and hence (k + 2^+^ - l)( 7n ) > 7( m +2)n- ■ 

Proposition 6.6. If k mod 2^ has at least one in its N-bit binary representation, then ^(70) = 
7 n where n is the position of the rightmost such (i.e., n is least such that k mod 2 n+1 < 2 n ). If 
k mod 2^ has at least two O's in its N-bit binary representation, then k^i) = j n > where n' is the 
position of the second-rightmost such 0. 

Proof. Apply Lemma 5.1 with x + 1 equal to the element of Bn corresponding to k. ■ 

Next, we give an extension of Lemma 6.1 which is a variant of Lemma 2.8. 
Lemma 6.7. Suppose n is a power of 2 and crk> 7 ( m +i) n . Then, for any y such that 1 < y < 2 n , 
we have k + 2 mn y = (k + 2 mn - l)(J( y )), where 6 y = mm 1<z < y (k + 2 mn z - 1)(7„). 
Proof. Induct on y. The case y = 1 is trivial, so suppose y > 1. We then have 

(fc + 2 mn - l)(j (y) ) = {k + 2 mn - l)(j(y-i))(k + 2 mn ) V (k + 2 mn {y - l))(fc + 2 m "). 

We have k + 2 mn 1(m ± 1>n j (2ran) , so 

(fc + 2 m "(y - l))(fc + 2 mn ) ( fc+2m "^- : i^<™+ 1 >") (fc + 2 mn (y _ i))(j (2raB) ). 

Apply Lemma 6.3 with s, fc, and y replaced by urn, /c + 2 mn (y — 1), and 2 mn respectively to get 
fc + 2 mn y 1 = n7 " ; (A; + 2 mn (y - l))(j (2 —))- Also by Lemma 6.3, we have 

fe + 2 y-1 = (fe + 2 {y - 1)) oj (2ro „_ 1); 

since Lemma 6.2 gives j( 2 ™™-i) (7n) = 7(m+i)m we get 

( fc + 2™(y - l))(7 (m+ i)n) = ((* + 2""% - 1)) o j (2 ^_ 1} )(7n) > (fc + 2™y - l)( 7n ). 
Therefore, 6 y is the minimum of 6 y -i, (k + 2 mn (y — l))(7( m +i) ra ), and (k + 2 mn y — l)(7 n ), so 
k + 2 mn y = (k + 2 mn - l)(j(y)), as desired. ■ 

This works even if k is the identity (which is thought of as having critical point greater than 
any 7jv), so that k + r is just j( r ). 
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7. Column 2 

Column 2, in both the regular construction and the irregular construction, is a list of embeddings 
bo,bi,. . . such that b n has critical sequence beginning £' i— > /?„ i— > /3 n +i, where we also have available 
an embedding z such that z has critical sequence beginning ( i— > i— ► /3q- Given a list of this type, 
one can generate another list of the type suitable for column 1 , which results in the production of 
2 2 " critical points below (3 n [2, Lemma 3]. Hence, if such a list is to be a column of the 'optimal' 
construction, we must have (3 n = 7 2 2« (and ( = 70, C' = 7i)- We will see in this section that a 
suitable sequence of embeddings does exist. The proof will be similar to that of Theorem 3.1, but 
will require the results of the previous section, as well as the following lemma. 

Lemma 7.1. Given k £ Aj, let f be the numerical mapping associated with k, so that k{^ n ) = 
7/( n )- Then f(n) > /(0) + (/(l) — /(0))n for all n. Furthermore, if we have f(n) = /(0) + re(/(l) — 
/(0)) for a particular n, then f(n') = /(0) + (/(l) - f(0))n' for all re' < n, and k mod 2- f ( n )+ 1 < 

Proof. As mentioned at the beginning of section 3, for any rre > 0, ,?( 2 m) has critical sequence 
beginning 7 m 1— > 7 m +i- But j has critical sequence beginning 70 > 71, and j( 2 m) = i(2 m -i)(j)) so we 
must have i( 2 --i)(7o) = 7m and j( 2 ™-i)(7i) = 7m+i- Now apply fc to this to get /cj( 2 ™-i)(7/(o)) = 
7/( m ) and kj^-i) (7/(1)) = 7/(m+i)- Hence, kj^ m -i) maps the critical points between 7/(0) 
and 7/(1) to critical points between 7/( m ) and 7/( m +i), so we must have /(m + 1) — /(rre) > 
/(l) — /(0). This is clearly true for rre = as well. Adding these inequalities for rre = 0, 1, . . . , n — 1 
gives /(n) — /(0) > (/(l) — /(0))n, as desired. 

If /(n) - /(0) = (/(l) - /(0))n, then we must have had f(m + 1) - /(m) = /(l) - /(0) for 
all m < n, so we get /(re') — /(0) = (/(l) — /(0))re' for all n' < re. We now show that, for 
/(l) < r < /(re), bit number r from the right (the 2 r 's bit) of k mod 2-^ n ) +1 is 0; this will give 
mod 2^™' +1 < 2-fW. If r is /(rre) for some rre, then k{~f m ) = j r , so Lemma 2.9 states that 
bit number r of A; mod 2^ n ^ +l must be 0. Now suppose r is not one of the numbers /(rei); then 
r must lie between /(m) and /(rre + 1) for some rre, with 1 < rre < re. Since /(rre + 1) — /(rre) = 
/(l) — /(0), fcj(2 m -i) must map the entire list of critical points between 7/(0) and 7/(1) to the entire 
list of critical points between 7/( m ) and 7/( m +i). In particular, we must have A;j( 2 ™-i)(7s) = 7r 
for some s, so bit number r of kj( 2 ™~i) mod 2^( n ) +1 is 0. Since cr (kj( 2 ™-i)) = 7/(0) > we have 
fcj( 2 m_i) mod 2 / (°) +1 = 2 / (°), so bit number /(0) of kj {2m _ X) mod 2- / '( n ) +1 is 1. This means that 
subtracting 1 from kj^ 2 m_ 1 - ) mod 2^ n ~ >+1 does not require borrowing from bit number r, so bit 
number r of (fej(2™-i) mod 2-^™) +1 ) — 1 is 0. Therefore, by the remarks preceding Lemma 2.9, bit 
number r of k mod 2^ n ^ +1 is 0, as desired. ■ 

We will also need results guaranteeing the existence of embeddings whose critical sequences begin 
in particular ways. One such result is Theorem 3.1(a). Another is: For any re > 2, there exists 
k <G Aj such that k has critical sequence beginning 70 >— > 71 >— > 7„. This is proved by induction 
on re. For the base case, since k 2 = 72 (this follows from cr j( 4 ) = k 2 , which is proved by an 
easy computation [7, 2]), j has critical sequence beginning 70 1— > 71 ^72- For the induction step, 
if k has critical sequence beginning 70 1— ► 71 1— > 7„, then j( 2 ^)k has critical sequence beginning 
7o !-> 7l ^ 7n+l- 

Theorem 7.2. For any natural number n, the following are true: 

(a) For any m < n, there exists k in Aj such that k has critical sequence beginning 7 22 n_ 2 m+i 1— > 
72 2 ™— 2 m 1 — ^ 7 2 2 " • 
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(b) If s is a multiple of 2 2 ", and k mod 2 S+2 = 2 s x + 2 s - 1 where 1 < x < 2 2 — 1, 
i/ien A;(7 2 2« ) > 7 s+22 «+i • Furthermore, if also k mod 2 S+ > 2 S+ , £/ien fc(7 2 2« ) > 

( c ) J( 2 2 2 " -2) ^ as critical sequence beginning 71 1— > 7 22 « 1— > 7 22 «+i ■ 

Proof. By simultaneous induction on ra. The case n = of (a) is vacuously true. Throughout this 
proof, let N = 2 2 "; then N 2 = 2 2 " + \ 

First, suppose (a) is true for n. We prove (b) for n by downward induction on x. 

If x = 2 N — 2, then Proposition 6.6 gives £(70) = 7 S and fc(7i) > Js+n- Therefore, Lemma 7.1 
implies k(^ r ) > 7 s +jv r for all r, so /c(7jv) > 7 s+ tv 2 - Furthermore, if we have equality here, then 
Lemma 7.1 implies £(71) = 7 s +jv and k mod 2 S+Ar < 2 S+Ar . 

Now suppose 1 < x < 2 N — 2. Then Proposition 6.6 gives 7*, < £(70) < J s +n, and, if £(70) = 7 S , 
then fc(7i) < 7 s +at. 

Claim 1. T/iere exisi critical points a and (3 and an embedding k G Aj such that 7*. < k(a) < 
1s+n < k(P) and k has critical sequence beginning a 1— > 1— > 7^. 

Proof. We consider several cases. 

Case 1: ^(71) > 7 s +v- Then, since x < 2^ — 2, we must have 7 S < fc(7o) < 7 s +iv- Let a = 70 
and /3 = 71, and obtain A: as in the paragraph preceding the statement of Theorem 7.2. 

Case 2: £(71) < 7 s +at but fc(7 2 2«-i) > 7 s +at. Then there is a largest m such that k(-y 2 2ri - 2m ) > 
7 s _i_at, and this m is less than n. Let a = 7 22 ™_ 2 ™+i and /3 = 7 2 2"- 2 ™ , and apply (a) for n and m 
to get k. 

Case 3: A;(7 2 2"-i) < j s +n- Since A; mod 2 S+Ar has at most N — 1 O's among its lowest s + N 
bits, we have k(jN-i) > 7s+jv- Hence, there is a largest m such that k(^N-2 m ) > 7s+Af, and this 
m is less than 2 n — 1. Let a = 7 A r_ 2m +i (this is not 7 because m < 2 n — 1) and /? = 7jv_ 2 m, and 
apply Theorem 3.1(a) for 2 n and m to get k. □ 

Now ci (kk) = k(cvk) = k{a) is strictly between 7^ and 'Js+n, so kk mod 2 S+Ar = 2 s x' where 
1 < x' < 2^ — 1 and x' is even (so x' < 2^ — 1). By (2.2), we have x' > x. Therefore, we can 
apply the inner inductive hypothesis to get (kk + 2 s — 1)(7jv) > 7s+at 2 - But Corollary 6.4 gives 
kk( ls+N ) > (kk + 2 s - 1)(7jv), so 

A:( 7JV ) = k(k(j3)) = kk(k((3)) > kk( 7s+N ) > (kk + 2 s - l)( lN ). 

Therefore, fc(7jv) > 7s+n 2 - If we also have k mod 2 S+Af2 > 2 S+Ar , then (2.2) gives kk mod 2 s+n2 > 
it mod 2 S+JV >2 s+Ar ,sothe inner inductive hypothesis gives (kk + 2 s — 1)(7jv) > 7 s +at2 and hence 
^(7a0 > 1s+n 2 - This completes the proof of (b) for n. 

Next, we prove (c) for n from (b) for n. Previous results easily show that j(2 N -2) has critical 
sequence beginning 71 1— > 7^ (Theorem 2.1 shows that crj( 2 jv_ 2 ) = 7 1; and J(2 iV - 2 )(j(7o)) = 
.7(2" -1) (7o) = 7iv), so we must show that 3(2^-2)(1n) = In 2 - 

Define a numerical function h as follows: if the number r has binary expansion b m 2 m , where 
b m is or 1, then let h(r) = b m (2 ( - m+1 ' >N — 2 mN ). In other words, h(r) is obtained from r by 
replacing each or 1 bit in the binary expansion of r with a block of N O's or l's. 

Claim 2. For 1 < r < 2 N - 1, we have j{h(r+i)) ~' N = +1 3{h(r))3(2»-i)- 

Proof. Let m be such that 2 m \\ r + 1 (that is, m is the location of the rightmost 1 bit in r + 1). 
Let y = 2 N — 1 and let k be j^(r+i-2 m )) (the identity if r + 1 = 2 m ). Since 2 m || r + 1, we have 
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2 m+1 | r + 1 - 2 m , so 2( m+1 ) JV | fc(r + 1 - 2 m ), so crfe > j( m +i)N- Also, k + 2 mN y = j( h ( r +i)) 

and k + 2 mN — 1 = j(h(r))- Now Lemma 6.7 states that j(h( r +i)) — j(h(r))j(N-i), where 9 y = 
mmi <z < y (k + 2 mN z — 1)(jn)- But (b) for n gives (k + 2 m7V z - 1)(7jv) > Jn 2 for all such z, so 
^j/ > 7iv 2 +i) and we are done. □ 

Apply this for all r from 2 N -2 down to 1, noting that h(2 N - 1) = 2 N * - 1 and /i(l) = 2^-1, 
to get 

3(2" 2 -l) = j( h ( 2 "-l)) 

Tjv_2 + i . 

— 3(h(2"-2))3(2"-l) 

TAT2 + 1 . 

— 3(h(2"-3))3(2"-l)3(2"-l) 

7W = +1 i(2^-i)i(2~-i) • • • 3(2" -l) (2 N ~ 1 times) 

= 3(2" -2)3(2" -1) ■ 

Since j( 2 ^-2)(7o) = 7o, we have -2)3(2" -i)(lo) = 3(2"-2)j(2"-i)(j(2"-2)(lo)) = 3(2"-2){in)- 
On the other hand, j^ 2 " 2 _i) (7o) = In 2 - Since J^^-i) an< i 3(2" -2)3(2" -l) agree up to 7at 2 +i ; we 
must have j(2" -2){1n) = In 2 - Therefore, (c) holds for n. 

Finally, assuming (a) and (c) hold for n, we get (a) for n + 1 as follows. Given (c) for n, we can 
apply Lemma 7.1 to j(2"-2) to get j(2 N -2)(lm) = jNm for all m < N. Now, to get a for n + 1 
and m where m = n, use the embedding j(2 N -2)- For m < n, apply (a) for n and m to get k; then 
j(2"-2)(k) works for n + 1 and m. ■ 

So one may expect column 2 of the ultimate construction to be a sequence of embeddings 
bo, b\, . . . such that b n has critical sequence beginning 71 1— > 7 2 2« 1— > 7 22 "+i • 

8. Column 3? 

Since the first two columns of the regular and irregular constructions have worked out to be 
potentially optimal, it is natural to ask whether the third column does also. The third column in 
these constructions is a list of embeddings do,d\,... such that d n has critical sequence beginning 
£ 1-^ 1-^ (5 n 1— ► (5 n+ i. Given a list of n embeddings like this, one can generate a list of C 2 (n) 
embeddings of the column 2 type, where C 2 (n) is defined recursively by C 2 (0) = and C 2 (n+ 1) = 
C 2 (n) + 2 c ' 2 ( n ) [2, Lemma 4]. Hence, there are at least 2 2 ° 2( } critical points below 6 n . If such a 
column is to be optimal, we must have S n = 7 22 c 2 (n) . Also, note that member b C2 ( n ) of column 2 
is obtained by applying member d n of column 3 to the first member of column 1, which is just j; 
hence, if b C2 ( n ) 'looks like' j( m ), then d n should 'look like' j( m -i)- So the third-column situation 
seems likely to be as follows. 

Conjecture 8.1. For all n, i( 2 /(™)_3) has critical sequence beginning 70 1— > 71 1— > 7/( n ) l— ^ 7/(n+i)> 
where f(n) = 2 2 ° 2( 

For n = 0, this conjecture states that j has critical sequence beginning 70 1— > 71 1— > 7 2 1— > 74, 
which we already know to be true. We will now see that the conjecture is true for n = 1 as well; 
that is, 7(13) has critical sequence beginning 70 >— > 71 1— >• 74 1— > 7 2 56- Unfortunately, the methods to 
be used do not have an obvious generalization to later members of the column. 
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We will be using a number of results from section 5 of Dougherty [2]; for instance, it is shown 
there that j(i3) has critical sequence beginning 70 1— > 7! 1— > 74. Also, we have K3 = 74, and 
J(i3)( K 2) = i(i4)(^3)- These can be combined with results in the present paper, which state that 
J(i4)(74) = Tie, to give J(i3)(k 2 ) = 7i6- 

The proofs here will be written in terms of ordinary restrictions k \ V a , rather than Laver 

* . a 

restrictions kDV a and the corresponding equivalence relation =. This is not one of the methods 
that automatically works in the finite algebra context as well because of Proposition 2.7. However, 
this particular proof can be translated in a systematic way into a proof using Laver restrictions, 
which does work in the finite algebra context; for an example of this, see the two proofs of Lemma 15 
in Dougherty [2]. But the translated proof is substantially longer and less readable. 

If k(a) = 0, and (3 is a limit ordinal, then clearly k \ V a C Vp. However, if (5 is inaccessible 
(as all critical points are) and a < /3, then the size of A; \ V a is less than (3, so we actually get 
k \ V a G Vp. For instance, j \ V Kl G V K2 and j \ V K2 G V K3 . 

Straightforward computation [2, §5] gives the following inequalities (which are not in the strong- 
est possible form): 

j(K 2 )>K 3 J(5)(«l)>«2 j(9)(«l)>«2 

^ i(2)(«2)>K3 i(6)(«l)>«2 j(10)(«l) > «2 

j(3)(«2)>«3 J(7)(«l)>«3 j(ll)(Kl)>K3 

j(4)(«2)>K2 j(8)(^ 2 )>K 2 j(12)(K2) > «3 



We also have j(i3)(K 2 ) = 7i6 ( as noted above) and J(i4)(k 2 ) = 7s (from Theorem 7.2 and Lemma 
7.1). We can use these facts to expand out part of j(i4), as follows. Start with j( 14 ) \ V 116 = J(i3)(j f 
V K2 ) = j(i2)j(j \ V K2 ). Since j(i 2 )(« 2 ) > K3 and j \ V K2 G V K3 , we actually have j( 13 )(j \ V K2 ) = 
i(i2)(i t V K2 ){j \ V K2 ). Similarly, since j (11) («i) > re 3 , we get j {12) (j \ V K2 ) = j(n)(j \ V Kl )(j \ V K2 ). 
Continuing this leads to the following expansion: 

(8.2) i (14) r v 116 = j(j \ v K2 )u r v K2 )u r v Ka )u \ v K2 )u \ v K1 )u \ v K1 )u \ v K1 ) 

u \ v K2 )u \ v K1 )u \ v K1 )(j \ v K1 )u r v K2 )u r v K2 ). 



(fc I 2*^ m 1) ("74 ) 

Lemma 6.3 implies the following: If cr fc > 74™, then k + 2 Am — 1 = &°J( 2 4ra -i)- Since 

j f V K2 G Ky 4 , we can conclude that 

(8.3) fc(j( 2 *™-i)(j r kj) = (fe + 2 4m - i)(j r = (k + 2 4m ) r F( fe+24m _ 1)(K2 ). 

If we make the stronger assumption that cr k > 74 m+ 4, then we get a stronger conclusion: Lemma 
6.2 gives j( 2 4m_i } (7 4 ) = 74m+4, so j( 2 ^-i)(j \ V K2 ) G F 74m+4 , so 



(8.4) (k + 2 4m - i)(j r kj = fc(i( 2 --i)(i r v K2 )) = i (2 4 m _ 1} (j r v K2 ) = j {2 * m) \ y 74 



m + 2 ' 



These equalities can then be restricted to smaller domains, to get k(j^ 4m -i){j \ V Kl )) = (k + 2 Am ) \ 
V( fc+2 4m_ 1 )( Kl ) from (8.3), and similarly for (8.4). 

Lemma 8.2. // cr k > 4m + 4, then (k + 2 4m - l)(j (14) \ V J16 ) = (k + 14 ■ 2 4m ) f V (k+2 4 m _ 1)hl6) 
and (k + 2 4m - l)(i (15) r F 78 ) = (k + 15 ■ 2 4 ™) \ V {k+2 , m _ 1){ls) . 
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Proof. Plug in (8.2), use the distributive law, and then apply (8.4) and (8.3) repeatedly: 

(fe + 2 4m -l)(j (14) W 116 ) 

= (k + 2 4m - \ V K2 )(j r V K2 ) ■ ■ ■ (j \ V K2 )) 

= (k + 2 im )((k + 2 4m - l)(j r V K2 ))((k + 2 4 ™ - \ V K2 )) ■■■((k + 2 4 ™ - \ V K2 )) 

= (k + 2 4m )(i (2 4 m _ 1} (j r ^))(j (2 --i)(j r kj) • • • (i (2 --i)(j r kj) 

= (fc + 2 ■ 2 4m )(j (24m _ 1) (j |- KJ) • • • (i (2 --i)(i r KJ) 

= (fc + 3-2 4m ).--(i (24m _ 1) (j fKj) 

= (fc + i3-2 4 " i )(j (24m _ 1) (i rKj) 

= (fc + 14-2 4m ) rV (fe+13 . 2 ^)( K4m+2 ) 

But clearly two functions which are equal must have the same domain, so (A; + 2 4m — 1)(V^, 16 ) = 
^(fc+i3-2 4m )(K 4 +2 ) and we get the desired result. The argument for j( 15 ) is the same with one more 
UW K2 ). U 

We now use the function h defined in the proof of Theorem 7.2, letting N = 4: h(r) is obtained 
from r by quadruplicating every bit in the binary expansion of r. So h(l) = 15, h(2) = 240, 
h(U) = 2 16 - 16, and so on. 

We can now show that, for any r, j( h ( r )){j(i5) \ V 1& ) = 3{h{r+i)) \ V j (h(r)) hs)- To see tnis > let 171 
be such that 2 m \\ r + 1, and apply Lemma 8.2 with k = j(h(r+i-2 m ))- Again, if we want, we can 
simply restrict both sides of this equality further to get j{h(r)){j(\5) \ V l4 ) = 3(h(r+i)) \ Vj (h(r)) (-/ 4 )- 

Now we can use (8.2) again, to get 

.7(13) (jj) f*j- (1 8)(K3) =i(i3)(i(j \v K2 )) 

= i(14)0'(14) \Vjie) 

= i ( i4)(i(j W K2 )(j \ v K2 )---(j \ v K2 )) 

= j(h(i))(j(i5) \V JS )(j {15) \ V 18 ) ■ ■ ■ (j {15) \V 18 ) 

= 3(h(2))(j(15) \ V ls ) ■ ■ ■ (j(i5) \V ls ) 

= 3(h(3)) ■ ■ ■ (.7(15) r Vis ) 

= i(ft(13))(i(15) T ^78 ) 
= 3\h{lA)) \ ^ (M13)) ( 7S ). 

Hence, by another application of Lemma 8.2, 

j(13)(j(3)) \ ^ ( i 3 )(«3) = J(i3)(i(i \ V ^)U \ K 2 )) 

= .7(13) (jO' \ Kj)(.7(14) I" *7ie) 
= J(/i(14))(j(14) T V" 7l6 ) 
= 3(2^-2) \ V J(h(14)) { 116 )- 

We can now apply this to 716 to get 

i( 2 i6_ 2) (7i 6 ) =i(13)(i(3))(7l6) = i(13)(i(3))(j(13)( K 2)) = j(13)(«3)- 
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But Theorem 7.2 gives j(2 16 -2)(7ie) = 7256, so we have j'(i3)(«3) = 7256, as desired. 

We can now use this evaluation of J(i3)(tt3) to compute some other critical points. For instance, 
since 7(7) (f^), i(i2)( K 3), an d J(i3)(^3) are equal to each other [2, §5], they are all equal to 7255- A 
more interesting fact is as follows: 

Proposition 8.3. The 256 critical points below k 4 produced by the regular construction [2, §2] are 
the first 256 critical points 70, 71 , ... , 7255 • 

Proof. Suppose we modify the regular construction by replacing the first entry in column 4, which 
is normally = jj(2), with J(i2)- Then this modified construction will produce 256 critical 

points below j(\2)( K 3) = 7256, so these must be the first 256 critical points. But j( 12 ) J(7 = ^ 
33(2) [2, §5], so the regular construction and the modified construction produce exactly the same 
results below 7(7) (^2) = 7256; hence, the regular construction also produces the first 256 critical 
points. ■ 

Since the irregular construction [2, §4] actually has 3(12) as the first member of column 4, it 
produces the first 256 critical points as well as 7256 itself. It is possible that the irregular construc- 
tion produces a very large initial segment of the list of critical points; if this is so, we would get 
equations like .7(11) (73) = 7 2 2 1X an d j(n)(74) = 7 2 2 2059 - 

9. Conclusion 

The results of this paper give some hope that an optimal recursive construction producing all 
critical points is possible. If the patterns shown so far continue, then one can expect column 
number N to contain embeddings which 'look like' j(f( n )-N): where f(n) is a power of 2; this gives 
some guidance as to the critical points patterns these columns should satisfy. 

However, even if such a construction is completed, one will not get a computation of the critical 
point counting function F unless one can tell where the critical points n n occur in the construction. 
Since the function F grows not only faster than the Ackermann function, but faster than a slow 
iterate of the Ackermann function [2], it must be the case that the number of columns between the 
places where the critical points n n occur is also a fast-growing function. This is a sign that there 
are still more complications to occur later in the construction. 
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